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Introduction. In his book,f Steinhaus discusses the problem of ranking n 
objects according to some transitive characteristic, by means of successive pair- 
wise comparisons. In this paper we shall adopt the terminology of a tennis 
tournament by a players. The problem may be briefly stated: “What is the 
smallest number of matches which will always suffice to rank all 2 players?” 

Steinhaus proposes an inductive method whereby, the first & players having 
been ranked, the (k-+1)-st player is matched against the median player in the 
first k, and by a “halving” process is finally ranked into this chain. Then the 
(k-+2)-nd player is ranked into the new chain of k-+1 players in the same man- 
ner. 

Using this process, a player can be ranked into a chain of k others in S(k) 
=1+[log. k] matches. Steinhaus thus shows that M(z) matches always suffice 
for players where 


M(n) = 1+ nS(n) — 28, 


He then states, “It has not been proved that there is no shorter proceeding pos- 
sible, but we rather think it to be true.” 

The purpose of this note is to present an improved procedure, compare it 
with Steinhaus’ M(m) as an upper bound and with a lower bound L(m) derived 
from information theory, and to discuss the asymptotic behavior of these three 
functions for large z. 

A lower bound, L(n), is easily seen to be L(n) =1+ [loge (2!) ], since each 
pairing can do no more than divide the remaining possibilities into two comple- 
mentary sets; the results of the comparison then selects one or the other of these. 
Observing 2! possibilities initially, with halving the best we can do at each 
stage, we are led directly to the above formula for L(). 


The improved procedure. This may be explained inductively in three steps 
(illustrated for the case n=19 by Fig. 1). Suppose 7 =2r or 2r-+1. 


1. Pair off 27 of the players and let the pairs play in the first round leaving 
one man out if # is odd. 

2. By a continuation of the present method applied to 7 players, give a com- 
plete ranking of these r first round winners. 

3. The third step is best explained by a diagram. 


At this point in the ranking we have a hierarchy of the form illustrated in 
Figure 1 for 7=19. First round winners J, J, +++, B are ranked in that order 
with J the best player. A is the first round loser to B, and other first round losers 


* Now at General Analysis Corporation. 
1 H. Steinhaus, Mathematical Snapshots, New York, 1950, pp. 37-40. 
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are indicated directly below their respective victors. The odd man drawing a 
first round bye is considered a loser and put in the position at the extreme left 
of the diagram. 
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The phrase “main chain” initially will refer to the chain JTH - - - CBA, and 
the procedure will be to insert the numbered points in the main chain in the 
order indicated. The procedure is based on the fact that the insertion of a 
single point into a chain by the Steinhaus method is most efficient if the number 
of points on the chain is of the form 2* —1. 

Hence we start by inserting point 1 in the chain ABC. After this has been 
done, the “main chain” under point 2 consists of AB and possibly point 1; this 
insertion can also be performed with two comparisons. 

We now turn to chains of length 2?—1=7, and observe that point 3 is as 
high as we can go, the main chain under 3 being composed of ABCDE and 1 and 
2, and so forth. 

This represents a ranking technique, requiring U(m) comparisons, where 
U(n) is given recursively as follows. 


U(1) = 0, U(2) = 1, 


U(2k) = k + U(k) + 2, T(i),* 


k+1 


U(2k-+ 1) =k+ U(k) + > TU), 


i=2 


* The T(t) are exactly the parenthetical numbers along the bottom of Figure 1. 
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where 
T(i) =2 for 1<i8 3, 
T(t) = 3 for 3<i185, 
Ti) =4 for 5<i8 11, 


ef 


Ti) =7 for tfHii<isS ht, 
and t;= {2#1-+(—1)7}/3. 


We shall now give a table of values of M(n), U(m), and L(n) for some selected 
values of m, thereafter making some empirical observations. 
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It may be observed that U(n)=L(n) for »=20 and 21 also; thus the pro- 
posed procedure is optimal for those values of m in addition to the values 
n<11. We conjecture that the value Z(12)=29 can not be achieved, but it 
seems to be difficult of proof. We further conjecture that U(m) is best possible, 
for all 2, but have no mathematical grounds on which to attack a similar con- 
jecture regarding L(). 


Asymptotic formulae. We state the following formulae for the asymptotic 
behavior of U(n), L(~), and M(n). The proofs are sufficiently cumbersome to 
be omitted. (The formulae for U(m) and M(m) are based on the subsequence of 
n which appear to give local minima for U(n)—L(n), i.e., n= [2*/3]| for some k. 
These being the “best” values, the comparison may be somewhat invidious.) 
The decimal values which appear as coefficients for 7 are, of course, approxima- 
tions. 


M(n) ~ nlogen — .9152 + O(loge n), 
U(n) ~ n loge n — 1.415” + O(loge x), 
L(n) ~ n loge n — 1.443” + O(loge 2). 
On the other hand, for the subsequence of »=2* for some k, which are the 


“best” values for M/(m), the coefficients of ~ in the expressions for M(n) and 
U(n) are —1 and —1.333, respectively. 


* A more correct statement for L() should be “the least integer K 2 logs w!.” Our formula is 
equivalent except for »=2. 


